NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Logistic-Beta Processes for Dependent Random Probabilities with Beta Marginals

https://doi.org/10.1214/25-BA1541

Lee, Changwoo J; Zito, Alessandro; Sang, Huiyan; Dunson, David B (December 2025, Bayesian Analysis)

Free, publicly-accessible full text available December 1, 2026
BLAST: Block-Level Adaptive Structured Matrices for Efficient Deep Neural Network Inference

Lee, Changwoo; Kwon, Soo Min; Qu, Qing; Kim, Hun-Seok (December 2024, Advances in Neural Information Processing Systems)

Large-scale foundation models have demonstrated exceptional performance in language and vision tasks. However, the numerous dense matrix-vector operations involved in these large networks pose significant computational challenges during inference. To address these challenges, we introduce the Block-Level Adaptive STructured (BLAST) matrix, designed to learn and leverage efficient structures prevalent in the weight matrices of linear layers within deep learning models. Compared to existing structured matrices, the BLAST matrix offers substantial flexibility, as it can represent various types of structures that are either learned from data or computed from pre-existing weight matrices. We demonstrate the efficiency of using the BLAST matrix for compressing both language and vision tasks, showing that (i) for medium-sized models such as ViT and GPT-2, training with BLAST weights boosts performance while reducing complexity by 70% and 40%, respectively; and (ii) for large foundation models such as Llama-7B and DiT-XL, the BLAST matrix achieves a 2x compression while exhibiting the lowest performance degradation among all tested structured matrices. Our code is available at https://github.com/changwoolee/BLAST.
more » « less
Full Text Available
BLAST: Block-Level Adaptive Structured Matrices for Efficient Deep Neural Network Inference

Lee, Changwoo; Kwon, Soo Min; Qu, Qing; Kim, Hun-Seok (December 2024, Advances in Neural Information Processing Systems)

Full Text Available
Learning-Based Near-Orthogonal Superposition Code for MIMO Short Message Transmission

https://doi.org/10.1109/TCOMM.2023.3274158

Bian, Chenghong; Hsu, Chin-Wei; Lee, Changwoo; Kim, Hun-Seok (September 2023, IEEE Transactions on Communications)

Full Text Available
Why the Rich Get Richer? On the Balancedness of Random Partition Models

Lee, Changwoo; Sang, Huiyan (July 2022, Proceedings of Machine Learning Research)

Random partition models are widely used in Bayesian methods for various clustering tasks, such as mixture models, topic models, and community detection problems. While the number of clusters induced by random partition models has been studied extensively, another important model property regarding the balancedness of partition has been largely neglected. We formulate a framework to define and theoretically study the balancedness of exchangeable random partition models, by analyzing how a model assigns probabilities to partitions with different levels of balancedness. We demonstrate that the "rich-get-richer" characteristic of many existing popular random partition models is an inevitable consequence of two common assumptions: product-form exchangeability and projectivity. We propose a principled way to compare the balancedness of random partition models, which gives a better understanding of what model works better and what doesn’t for different applications. We also introduce the "rich-get-poorer" random partition models and illustrate their application to entity resolution tasks.
more » « less
T-LoHo: A Bayesian Regularization Model for Structured Sparsity and Smoothness on Graphs

Lee, Changwoo; Zhao Tang Luo; and Huiyan Sang (December 2021, Advances in neural information processing systems)

Graphs have been commonly used to represent complex data structures. In models dealing with graph-structured data, multivariate parameters may not only exhibit sparse patterns but have structured sparsity and smoothness in the sense that both zero and non-zero parameters tend to cluster together. We propose a new prior for high-dimensional parameters with graphical relations, referred to as the Tree-based Low-rank Horseshoe (T-LoHo) model, that generalizes the popular univariate Bayesian horseshoe shrinkage prior to the multivariate setting to detect structured sparsity and smoothness simultaneously. The T-LoHo prior can be embedded in many high-dimensional hierarchical models. To illustrate its utility, we apply it to regularize a Bayesian high-dimensional regression problem where the regression coefficients are linked by a graph, so that the resulting clusters have flexible shapes and satisfy the cluster contiguity constraint with respect to the graph. We design an efficient Markov chain Monte Carlo algorithm that delivers full Bayesian inference with uncertainty measures for model parameters such as the number of clusters. We offer theoretical investigations of the clustering effects and posterior concentration results. Finally, we illustrate the performance of the model with simulation studies and a real data application for anomaly detection on a road network. The results indicate substantial improvements over other competing methods such as the sparse fused lasso.
more » « less
Full Text Available

Search for: All records